INFORMATION AND CONTROL 9, 177-189 (1966) 


Simple Self-Reproducing Universal Automata 

Michael A. Arbib 

Division of Engineering Mechanics, Stanford University, Stanford, California 

von Neumann and Thatcher have shown that one may construct 
self-reprodueing universal arrays using as basic cells finite autom¬ 
ata with only 29 States. The simplicity of the components necessi- 
tates complex programming. 

We present a self-reprodueing universal array with simple program¬ 
ming. This is made possible by using as basic unit a finite automaton 
which can execute an internal program of up to 20 instructions. 

I. CT-MACHINES AND THEIR EMBEDDING IN A TESSELLATION 

1.1. CT-machines (Thatcher, 1965) combine the functions of a 
W -machine (the programmed Version of a Turing machine introduced by 
Post (1936), Wang (1957), and Lee (I960)) and of a construction 
machine which is a print-only machine with a half-plane for its tape. The 
constructing arm operates on one square at a time, where it can print one 
symbol from the alphabet V c ; the arm may also be instructed to move 
one square up, down, feft or right. (See Fig. 1.) 

A CT-machine is programmed with a finite list of instructions from the 


following: 

C-instructions 

u, d, r, l 

move arm up, down, right, left 

(constructing): 

C{x) 

build x in the scanned square 

T-instructions 

+ 

move tape left one square 

(tape—or 

— 

move tape right one square 

Turing!): 

e 

erase scanned square (print 0) 


m 

t + (w), 
t - ( n ) 

mark scanned square (print 1) 

If instruction k is t ± (n), and the 
scanned tape square is marked, 
proceed to instruction k ± n; if 


* This research was supported by the Air Force Office of Scientific Research 
under Contract AF 49(638)-1440. 
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Given a square (m, n), we say it has four neighbors, the cells with Co¬ 
ordinates (m + 1 , n), (to — 1 , n), (to, n + 1), and (m, n — 1). 

von Neumann (1951,1965) and Thatcher (1965) have shown that one 
may construct self-reproducing universal arrays using as basic cells finite 
automata with only 29 states. The price we pay for the simplicity of the 
components is that the coding of the array is enormously complicated, 
and the Operation of the array requires many many steps to simulate one 
cycle of an ordinary Turing machine. 

Our purpose here is to present a self-reproducing universal array with 
simple coding, and a time scale which is (except for transfer instructions) 
similar to that of an ordinary Turing machine. The price we pay for the 
simplicity of programming and Operation is that our cells are more com¬ 
plicated—the basic unit is a finite automaton which can execute an 
internal program 2 of up to 20 instructions. 

Our basic module could, of course, be decomposed into simpler com¬ 
ponents. However, adopting a hierarchical (or biological ) 3 view of Systems 
Organization, the present construction is satisfactory without undertak- 
ing a module decomposition—in fact, it is preferable. 

There is one other factor, besides the program-controlled behavior of 
the modules, which drastically simplifies our task. This is the use of a 
“welding” Operation (suggested by von Neumann in his 1948 Princeton 
lectures—see Burks (1960)—but abandoned in his tessellation model; it 
was used by Myhill (1963) in his outline of the design of a self-reproduc¬ 
ing C-machine, as distinct from CT-machine) whereby cells can be 
formed into aggregates which may be moved about the plane en masse. 
Thus, in distinction to the von Neumann-Thatcher model, the tape of a 
Turing machine will actually be modeled as a one-dimensional string of 
cells welded together so that they can be moved left or right on command. 

Consider the following excerpt from Burks (1960). “Someone ob- 
jected to von Neumann that the problem of self-reproduction can be 
trivialized by assuming an element sufficiently complicated, or by de- 
fining self-reproduction suitably. For example, in a cellular model, we 
could have two states, quiescent and excited, and stipulate that each 
excited state excites its neighbor, thereby reproducing itself.” The 
point of our construction is not that very simple or very complex com- 

2 We shall henceforth refer to the internal programs as i-programs to avoid 
confusion with the program of the CT-machine. 

3 The relationship of this work to work on parallel computation in tesselations, 
and some comments on biological Systems, will appear in the Proc. Conf. Com¬ 
puter Sei. Systems held at the University of Western Ontario, September, 1965. 
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ponents can be used to build a self-reproducing automaton; but rather 
that, given components of one level of complexity, we may use them to 
obtain self-reproducing aggregates of an arbitrarily higher level of com¬ 
plexity (e.g., a universal Turing machine)—complexity being in the 
senseof Rabin (1960), Ritchie (1963), and Arbib and Blum (1965). 

1.2. Our basic cell has the structure shown in Fig. 2. There are input 
and output channels, and weld positions, in each of the four directions, 
a bit register BR, and 20 registers to hold the f-program. 

The hatching denotes the combinatorial circuitry which combines the 
inputs and the setting of the registers and welds at time t to determine 
the move between t and t + 1; new register settings and output of the 
module at time t + 1. 

The quiescent state is that in which all 25 registers (including the 
four weld registers) are set to zero. 

1.3. Each weld may be on (state 1) or off (state 0); and two neighbor- 
ing cells are said to be welded if either of the welds on their common 
boundary is in state 1, 
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Let W be the ancestral relation of welding on cells, i.e., 

W(a, b) <=> a and b are welded neighbors 

or 3 c such that W(a, c) and W(c, b). 

A collection C of cells is called comoving if a 6 (7 =>: & €; C W(a,b). 

We make the following convention: 4 If any cell in a comoving set C 
receives at time t an instruction to move in direction x (u, d, l, or r), 
and no cell in the set receives at time t an instruction to move in any 
direction other than x, then a cell a of C will move one square in the 
direction x if its neighbor at time t in direction x is either quiescent (all 
registers, including welds, in 0 state) or belongs to C. If the cell is occupied 
at time t by a cell not in C, and it does not belong to a comoving set 
which has been instructed at time t to move in direction x, then our 
cell “disappears,” i.e., the state of the neighboring square in direction 
x remains unchanged. If two comoving sets both tend to move into a 
given square at time t, then that square is to be blank at time t + 1. 

The behavior of a module between times t and t + 1 will be governed 
by the state of its four neighbors at time t, with the sole exception of the 
move operations described above. 

1.4. The over-all plan of the machine is very simple, as shown in 
Fig. 3. The CT-program consists of a linear string of comoving cells, 
partitioned into substrings of 2:1 adjacent cells the left-most of which 
has (BR) = 1, the remainder of which (if any) have (BR) = 0. The 
mth such substring from the left represents the mth instruction of the 
CT-program, and contains one cell unless the mth instruction is t ± (n), 
in which case it contains n =F 1 cells. 

The tape head, comprising two cells, serves: to read T-instructions 
from the program; to execute them; to initiate construction operations 
above the program cell above it; and to move the CT-program. The 
tape consists of a linear string of comoving cells, one for each square of 
the tape of the simuland—with (BR) of a cell being 1 or 0 as the cor- 
responding square of the simuland is marked or not. 

4 1 have rather loosely talked of a cell moving in direction x when in fact it is 
the contents of the registers that are moved. This should cause no confusion. 
Thus, a comoving set really comprises a pattem, rather than actual cells of the 
tessellation. 
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CONSTRUCTiNG 
AREA 



TAPE AREA 


l 


TAPE AREA 


COMPUTATION TAPE 
Fig. 3. Over-all plan of embedded CT-machine 


II. i- PROGRAMMING THE MODULES 

Given the above framework, one can generate many schemes for i-pro- 
gramming the modules. The reader who is convinced of this may skip 
the details of this section. 


2.1. The Order Code of the Modules. We shall write the various module 
instructions out in semiEnglish—in building an actual module one would 
oode these instructions in some compact albeit unintelligible way. In 
what follows A may take on the direction values u, l, d, r, or lr\ b € {0,1} 
denotes the contents of BR or a weld register; k, k' 6 {BR, 1 , 2, • • • , 20} 
denote registers. An instruction of the form “(A)x” teils the module to 
emit on its output in direction A the Order x. 


“weld Ah” 

“emit Ak” 

“move A” 

“A = 0:YES(fc), 
NO(fc')” 

“go to k” 

A)k ; k' ” 


teils the module to change the state of its A 
weld (or welds) to b 

teils the module to emit in direction A .the con¬ 
tents of its k register 

teils the module to move (subject to the limita- 
tions of Section 1.3) in direction A 
If the input from direction A equals 0, execute 
instruction k next; if not, execute instruction 
k' next. 

teils the module to next execute instruction k 
(it may be regarded as an abbreviation for 
“A = 0: YES(fc), NO(fc)”) 
executed by a module a causes the contents of 
a’s register k to be stored in the register k' 
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of the A neighbor of a 

“A place b” teils the module in direction A to place b in its BR 

“stop” This is overridden by an input instruction but 

will be executed when control of inputs lapses, 
unless inputs transfer control to some other 
instruction of the internal program. 

This completes our list. Several of these instructions could be avoided 
by reprogramming. Other instructions could be added to make our cell 
aggregates more useful and economical for executing tasks other than 
that which occupies us here. 

To complete our specifieation we should say what happens if a module 
receives contradictory instructions from its neighbors. However, we 
leave this open since we shall not need to invoke such a Convention in the 
programming that follows. 

2.2. Instructions m, e , +, —. These instructions are coded by a four 
instruction t-program in the first four registers in the CT-program 
square (P (which, of course, has BR set to 1): 

(P: 1. (d) 4, 5 

2. (d) go to 4 

3. stop 

The contents of register 4 depend on the instruction: 

m: (d ) place 1 
e: (d) place 0 
+ : ( d ) move l 
— : (d) move r 

(P will be above the first square of the control head Gi which will act 
by activating the (P, which loads the appropriate f-instruction in register 
5 of Gi, which then proceeds to make sure that the tape square below it 
is welded into the tape, execute the tape instruction, and then advance 
the program tape, and activate the new CT-program square <P, thus 
completing the cycle: 

Ci : 1. ( u) move l 4. (d) weld Ir 1 

2. ( u) go to 1 5. (to be loaded by (?) 

3. stop 6. go to 1 
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2.3. Instructions t ± ( n ). Most of the logic is contained in the second 
square 62 of the control head. A small routine in 6\ loads Instruction 
15 of e 2 , telling whether the transfer is left or right: 

(P: 1. ( d)l 

2. ( d) 5, 15 

3. ( d ) go to 1 

4. stop 

The contents of register 5 depend on the instruction: 

t+: (u) move l 
t— : (u) move r 

Now, for a transfer instruction t ± (n), (Pj is followed by k = n =Fl 
squares with their BR set to 0. The first job of C 2 is to test whether the 
scanned tape square is 1—if not, it simply advances the CT-program 
to the next instruction, and returns control to 6i : 

e 2 : 1. (d) emit u BR 5. u = 0: YES(3), NO(6) 

2. d = 0: YES(3), NO(9) 6. move r 

3. (u) move l 7. (I) go to 2 

4. ( u) emit d BR 8. stop 

If the tape square is marked, e 2 has now to generate on its right a 
string of k squares with Wr (the right Weid register) set to one, so 
that it may remember k while moving the CT-program tape: 

e 2 : 9. (u) move l 12. (r) move l 

10. (u) emit d BR 13. (r) weld r 1 

11. u = 0: YES(12), NO(15) 14. go to 9 

Having done this, (S2 has advanced the CT-program tape by one in¬ 
struction string. So for t + ( n) it must move the CT-program tape a 
further k — n — 1 instruction strings left; for t — (n) it must move the 
CT-program tape k = n + 1 instruction strings right: 

e 2 : 15. ( r) emit l Wr 18. (to be loaded by (Pi : 

(u) move l/r) 

16. r = 0: YES(6), NO(17) 19. (u) emit d BR 

17. (r) move l 20. u — 0: YES(18), NO(15) 

We are at a loss to use the “simpler” switching methods of digital com- 
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puters, since we have no a priori bound on the number of instructions, 
and thus (because of transfers) on the length of these instructions. 

Note that it is the number 20 of instructions in the program of C 2 
that determined our choice of 20 as the number of instruction registers 
in each module. If we had allowed more instructions, we could have 
used a one module control head. Using more squares in the control head, 
and some ingenious reprogramming, we can reduce the number of 
registers per module. 

2.4. Construction Instructions. We do not have a constructing arm 
as in the CT-machine of Section 1.1. Rather we construct (or print) 
only in the square up two from Ci, and then move constructed cells 
about in the construction area by appropriate i-programming. 

“Construction” then oceurs in the square above the program square 
above Ci. If we want to load b in BR, all 20 instructions, set all four 
welds of the construction square, and then move the constructed cell 
one square in some direction, we may do it with three CT-program cells 
<Pi, (P 2 , (Pa, in that order. 


1. (u) 11, 1 

(Pa : 1. <u> 11, 9 

(P* : 1. (u) 13, 17 

8. (u) 18, 8 

8. <«) 18, 16 

4. <«) 16, 20 

9. (d) go to 1 

9. (d) go to 1 

5. (u) place b 

10. stop 

^ 1 '] instructions 

’ fto be loaded 
lo.J 

10. stop 

1L | instructions 
lg j to be loaded 

ö.] up to four 
• instructions 

9. j (u) weld Ab 

10. (u) move A 

11. (d) go to 1 

12. stop 

1 [instructions 

■ fto be loaded 
16.J 


III. THE BASIC CONSTRUCTION RESULTS 

All the hard work was done in Section 2. We can now use our under- 
standing of effective procedures to deduce the results that could form 
the basis for an axiomatic treatment of self-reproduction like that given 
by Myhill (1964). 

By an embedded CT-automaton we shall mean a control head together 
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with a finite program and finite tape, appropriately positioned, coded as a 
tessellation configuration in the manner described above. 

We state, rather imprecisely, 

Theorem 1. Any CT-automaton may be effeetively represented as an 
embedded CT-automaton. 

Proof: Section II, Q.E.D. 

Corollary. Any Turing machine—thus any effedive computation — 
can be embedded in the tessellation. 

Theorem 2. There is an effedive procedure whereby one can find for a 
given CT-automaton ft an embedded CT-automaton c(<3) (read “construdor 
of ft”) which when started (by telling its control head to “go to 1”) will 
proceed to construd ft in the three rows of its construding area immediately 
above it, and adivate ft by telling its control head “go to 1.” 

Proof: c(ft) merely needs a control head and a CT-program but no 
tape. If ft has a program of length n, then at most the first 3 n instruc- 
tions of the program of c(ft) sufhce to construct this program, in the 
first row of the constructing area, as a comoving set. This is then moved 
left or right the appropriate number of squares, and then up one square. 

We then move the program of ft up a further square by secreting a 
cell with an up weld, move it up one carrying the program, then build 
a cell which destroys its weld. The program square of c(ft) bears the 
f-program, for the latter square, as shown below. 


1. (u) 8, 1 

2. (u) 9, 2 

3. (u) 10, 3 

4. (u) go to 1 

5. stop 


6. (d) go to 1 

7. stop 

8. (u) weld u 0 

9. ( d) go to 6 

10. stop 


The control head of ft is then built and moved up one, after which the 
tape of ft is constructed as a comoving set and moved left or right to 
Position it appropriately. The final square of the program of c(ß) 
(or, at least, of that part used in the construction of ft) then serves to 
activate the control head of ß: 6 


1. (u) 5, 1 

2. (u) 6, 2 

3. ( u) go to 1 


4. appropriate transfer of control instruction: e.g., 
(d) go to 1, or stop 

5. (u) go to 1 

6. stop 

Q.E.D. 


5 The two instructions loaded in the tape square of ft are irrelevant to its per- 
formance as a tape square. 
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Thanks to these results we may argue about CT-machines whose 
construction alphabet is V c = {the set of different register configurations 
of a module of our tessellation}, and then embed them. 

U G V c represents a module with all (including weld) registers in 
state 0. 

Consider recursive function theory on V c , the set of finite configura¬ 
tions of nonquiescent squares in the tessellation, i.e., the set of functions 
/:/ X I —* V c such that the cardinality of {(m, n) | f(m, n) X U) 
is finite. 

Clearly this set is denumerable; we have an effective procedure for 
going from a CT-automaton ft to an element f a of V c , the embedding 
of ft, and we may find a birecursive function (Arbib, 1965) h: V c —> {0, 1} *, 
which effectively assigns a distinct tape configuration to each embedded 
CT-automaton. 

Now there is a recursive function for going from A(/ a ) to/ a . Thus, by 
Turing’s thesis (Arbib, 1964, Sec. 1.6) in a CT-Version, we may deduce, 
following Thatcher (1965, Sec. 8). 

Theorem 3. For each birecursive function h:V c —* {0, 1}*, there exists 
a CT-machine which not only can function as a universal Turing machine, 
but when started scanning the left-most end of h(f) on its tape, will proceed 
to print the configuration f in its construction area. 

Now consider the machine C" which has a slightly longer program: 
namely before constructing /, it copies its tape into the right position. 
It then constructs / (the programming has to be slightly modified to 
stop the tape from obstructing this construction-—this is left as an 
exercise for the reader) and positions the tape correctly. Considering 
C" with the description of C" on its tape, we conclude with 

Theorem 4. There exists a self-reproducing universal array embeddable 
in our tessellation. 
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